The weighted random graph model 
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We introduce the weiglited random grapli (WRG) model, which represents the weighted counter- 
part of the Erdos-Renyi random graph and provides fundamental insights into more complicated 
weighted networks. We find analytically that the WRG is characterized by a geometric weight 
distribution, a binomial degree distribution and a negative binomial strength distribution. We also 
characterize exactly the percolation phase transitions associated with edge removal and with the 
appearance of weighted subgraphs of any order and intensity. We find that even this completely null 
model displays a percolation behavior similar to what observed in real weighted networks, implying 
that edge removal cannot be used to detect community structure empirically. By contrast, the 
analysis of clustering successfully reveals different patterns between the WRG and real networks. 



89.75.Hc,02.10.Ox,02.50.-r 

The Erdos-Renyi (ER) random graph [1] is the proto- 
type of all unweighted network models: in a graph with 
N vertices, an unweighted edge is drawn independently 
between any pair of vertices with equal probability p. 
The ER model provides a fundamental reference for the 
properties of real networks, whose global properties are 
in most cases expected to arise from surely more compli- 
cated, but nonetheless local, decentralized mechanisms. 
Indeed, the explosion of interest towards complex net- 
works [2] originates precisely because of the striking dif- 
ference between the observed properties of real networks 
and the behaviour of the ER model. For this reason, vir- 
tually any model of unweighted networks is an extension 
or modification of the ER random graph. One important 
example is the class of hidden- variable or fitness network 
models [3,4], where the connection probability is assumed 
to be no longer a constant p for all pairs of vertices, but 
a function pij = p{xi,Xj), where Xi represents a variable 
(fitness) associated with vertex i. Both the analytical 
treatment and the intuitive understanding of such more 
complicated models strongly rely on well-established re- 
sults for the ER random graph, which thus remains at 
the basis of whole network theory. 

Recently, more and more results indicate that a com- 
plete and balanced description of complex networks is 
only possible if the full degree of heterogeneity present in 
edge weights is taken into account [5]. Even for purely 
unweighted graphs, edge weights naturally emerge as dy- 
namical properties, when transport, random walks or 
other processes take place on the network [6,7]. This has 
triggered a series of studies aimed at generalizing con- 
cepts and quantities originally developed for unweighted 
graphs to weighted networks [5,8-14]. Quite surprisingly, 
despite this rapidly evolving context, the studies devoted 
to the most basic and fundamental graph model - the ER 
model - have not been paralleled by analogous efforts to 
understand weighted networks by comparison with a sim- 
ple prototype model. Indeed, all the weighted network 
models that have been proposed fall in two broad classes: 
evolving models [15,16] aimed at reproducing the empir- 
ical properties of real networks through generalizations 



of unweighted growth rules [17,18], and weighted ensem- 
bles obtained by keeping some property of a real network 
fixed, and randomizing the graph in any other respect 
[11-13,19]. Both classes retain some of the heteroge- 
neously distributed properties of real weighted networks, 
either as the outcome or as the input, and therefore do 
not represent a completely homogeneous null model, as 
the ER is for unweighted graphs. Therefore, while these 
models are important for various reasons, they do not 
allow to decouple two possible sources of heterogeneity. 
To see this, note that the properties of interest in real 
weighted networks are defined as generalizations of their 
unweighted counterparts [5,8-10,12-14]. At present, it 
is still unclear whether the behaviour observed at the 
weighted level in real networks arises simply because of 
the broad range of allowed weight values (a possibility 
that does not apply for unweighted networks), or be- 
cause of deeper tendencies and correlations, as empiri- 
cally found at the unweighted level. A weighted counter- 
part of the ER random graph would exhibit only the first 
type of heterogeneity, thus representing a novel reference 
in order to correctly interpret the observed behaviour 
of weighted networks. It would also represent the basis 
for separately introducing the second type of heterogene- 
ity, since the recently identified correct null models for 
weighted graphs [19], that allow to preserve features of 
real networks, can be redefined as sophistications of a 
homogenous weighted model. 

We now introduce a model with the above character- 
istics, and denote it the weighted random graph (WRG). 
We follow a straightforward analogy with the unweighted 
case. The ER random graph can be obtained as a 
particular case of the configuration model, represent- 
ing the maximally random ensemble of unweighted net- 
works with specified degrees [20,21]. The latter can 
be described analytically by the connection probability 
Pij = XiXj /{I + XiXj), where each parameter Xi allows 
to control the expected degree (fc^) — J2j^iPij vertex 
i. This ensures that all graphs with the same degree se- 
quence {ki}fLi are equiprobable [21]. If all vertices are 
assigned the same value Xi — xq, the above probability 
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becomes a constant p = Xq/{1 + Xq) and the ER model 
(where all vertices have the same expected degree) is re- 
covered. In such a way, all graphs with the same number 
of links L become equiprobable. We define the WRG fol- 
lowing an analogous derivation. In a weighted network, 
the strength of a vertex i is defined as Si = J2j^i '^ij ' 
where Wij is the weight of the edge connecting i and j. 
The weighted analogue of the configuration model is the 
maximally random ensemble of weighted networks with 
specified strengths [11-13,19]. Recently, it was shown [19] 
that this ensemble can be generated by drawing an edge 
of weight w between vertices i and j with probability 

<l^jiw) = (y»2/j)'"(i - Vtyj) (1) 

where yi < 1 allows to tunc the expected strength (si) 
of vertex i. The case w — corresponds to no edge 
being drawn. The above expression allows w to take 
discrete, integer values in the range [0,-|-oo). If a finite 
maximum allowed weight Wmax is considered, the above 
expression must be divided by 1 — {yiyj)^"^"''^^ , which 
however rapidly converges to 1 as Wmax increases. In 
this model, all weighted graphs with the same strength 
sequence {si}fli ^'^^ equiprobable [19]. 

In analogy with the ER model, we define the WRG 
as a weighted graph where all vertices are statistically 
equivalent, or = yo < 1 Vi. Setting p = (we will 
justify this choice in a moment) and assuming for sim- 
plicity Wmax = -f-cxD, wc obtain 

q{w)=p^{l-p) (2) 

as the probability that any two vertices are joined by an 
edge of weight w. This completely specifies the model. 
A demonstration allowing to generate small weighted 
graphs according to the model is available [22]. 

Note that g(0) is the probability that no edge is drawn. 
Therefore 1 — q{Q) — p \s the probability of an edge of 
any (nonzero) weight. If we project the weighted net- 
work onto an unweighted graph, we therefore obtain an 
ER random graph with connection probability p, justify- 
ing the choice of the symbol. However, for the purpose of 
fitting the model to a real network, the Maximum Like- 
lihood principle indicates different parameter choices in 
the two cases. Using the results in ref. [23], for the ER 
model the optimal parameter choice is p* — 2L/N{N— 1) 
where L is the number of links in the real network, while 
for the WRG it is easy to show that the likelihood- 
maximizing choice is 



^ N{N - 1) + 2W ^ ^ 

where W = X]i<j ^^e sum of edge weights in the 

real network. This is the optimal criterion to be used in 
order to tune p in the WRG, all other choices resulting 
in a decreased likelihood [23]. In particular, one should 
not be tempted to use p in order to reproduce the num- 
ber of links L of the purely topological projection as for 
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FIG. 1. An incomplete list of equiprobable graphs with 
iV = 4 vertices in the WRG. 

the ER model. This refiects the fact that in the WRG 
model the equiprobable realizations are those with the 
same total weight W, not those with the same number of 
links L. To illustrate this point, fig.l shows four graphs 
that are equiprobable in the WRG, since they have the 
same number of vertices {N = 4) and the same total 
weight {W = 6). The graph A has I = 2 weighted edges 
of weight w = 3, B and C have I = 3 edges of weight 
w = 2, and D has I = 6 edges of weight w = 1. Note that 
the unweighted projections of these graphs have different 
numbers of links L. 

The form of q{w) appearing in eq.(2) coincides with 
the so-called geometric distribution with parameter 1 —p. 
The latter represents the probability that, in a series of 
independent Bernoulli trials with success probability p, w 
successes occur before a failure occurs for the first time. 
Thus the WRG is completely equivalent to the following: 
select one pair of vertices, and start a series of Bernoulli 
trials with success probability p. Each success implies the 
formation of a link of unit weight between the same two 
vertices, w multiple successes correspond to a weighted 
edge of weight w. As soon as a failure occurs for the 
first time, the sequence of trials stops and a new pair 
of vertices is selected. The process is repeated until all 
pairs have been considered. Indeed, this procedure rep- 
resents a fast and convenient way to generate the WRG 
my means of numerical simulations [22]. Note that we 
are considering an undirected weighted model. The ex- 
tension to the directed case is straightforward. 

We now derive a series of exact results for the WRG. 
Since all weights are independently drawn from q{'w), it 
follows that the weight distribution P{w) is simply 

P{w) = q{w) (4) 

as confirmed in fig. 2. The expected weight of any edge is 

ui=0 ^ 

and the variance is 

(-^) - {wf = (6) 

Note that, for networks approaching the fully connected 
topology (p — > 1), the mean and variance of the weight 
distribution diverge. 

At a purely topological level, the expected degree of 
any vertex is {k) = {N — l)p as, in the ER model, and 
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FIG. 2. Weight distribution in the WRG. For each choice of 
the parameter p, isolated points correspond to the histogram 
P{w) of edge weights in numerical simulations of the model 
with A'^ = 1000 vertices, and solid lines represent the theoret- 
ical curve q{w) obtained for the same value of p. 



the probability P{ki = k) that the degree of vertex i 
equals the value k follows a binomial distribution with 
parameters iV — 1 and p. The usual approximation for 
the ER model is to neglect correlations between vertices, 
which implies that P{ki — k) coincides with the degree 
distribution P{k) of the whole network: 



P{k) 



N -I 
k 



/(1-p) 



N-l-k 



(7) 



The mean is {N — l)p and the variance is {N — l)p{\ —p). 

We now consider the strength distribution P{s). The 
expected strength of any vertex is (s) = (iV — l){w) ~ 
{N — l)p/ {1—p). Using the same approximation leading 
to eq.(7) (i.e. neglecting correlations between vertices), 
the strength distribution P{s) of the whole network co- 
incides with the probability P{si — s) that Si equals the 
particular value s. Note that Si = '^ij ^ sn-m 

of iV — 1 independent geometrically distributed variables 
with parameter 1 — p, and its distribution P{si — s) is 
known as the negative binomial distribution with param- 
eters — 1 and 1 — p, and reads: 



P{s) 



N ~2 + s 
N -2 



f{l~p) 



N-l 



(8) 



The mean is (N — l)p/{l — p) and the variance is 
(iV — l)p/{l — pY ■ The degree and strength distribu- 
tions are shown in fig. 3 in cumulative form. Numerical 
simulations are in perfect accordance with the analytical 
results. Note that P{k) « P{s) when p « 0, while the 
two distributions become increasingly different as p ^ 1 : 
the degrees are distributed with vanishing variance about 
the average value (fc) = N — \ (fully connected topology) , 
while the strenghts are distributed with diverging vari- 
ance about a diverging (even for finite N) average value. 

An important property of weighted networks is their 
resilience against targeted edge removal [16,24-26]. One 
is interested in studying the behaviour of the largest con- 
nected component (LCC) and the clustering coefficient in 
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FIG. 3. Cumulative degree and strength distributions in 
the WRG. The four panels correspond to different choices of 
the parameter p. In each panel, points correspond to the 
cumulative histograms of vertex degrees (left) and strengths 
(right) in numerical simulations of the model with A*' = 1000 
vertices, and curves represent the theoretical cumulative dis- 
tributions P>(k) (solid) and P>{s) (dashed). 

the unweighted projection under progressive deletion of 
edges, in increasing or decreasing weight order. In real 
networks, one observes that the LCC is surprisingly ro- 
bust under strong link removal, and very fragile under 
weak link removal [24-26]. In particular, for weak link 
removal a percolation phase transition where the LCC 
fragments abruptly at a finite critical weight is observed, 
while for strong link removal no phase transition is ob- 
served. This behaviour is often taken as a signature of 
community structure: the network is interpreted to be 
organized in communities, with strong intra-community 
edges and weak inter-community ones [16,24-26]. We 
now consider the resilience of the WRG, that can be stud- 
ied exactly. If all edges with weight smaller than w are 
removed (weak link removal) , it is easy to see that the re- 
maining edges form an unweighted projection equivalent 
to an ER random graph with connection probability 
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q{v) ^ p- 



(9) 



Similarly, if all edges with weight greater than w are re- 
moved (strong link removal), the unweighted projection 
will be an ER random graph with probability 



Pn 



J2q{v)=p-p'^+' 



(10) 



v=l 



Thus, in both cases we can exploit a well known result for 
the ER model [1] to obtain the equation obeyed by the 
fractions 5*^ and S~ of vertices in the LCC after weak 
or strong link removal (up to the value w) respectively: 

5± = l-exp(-z±5±) (11) 

where = {N — l)p^ is the average degree after edge 
removal. The theoretical value of 5*^ is the largest so- 



3 



1.2 
1.0 


Weak link removal -> 


<- Strong link removal 










0.8 


• p = 0.2 \ 


\ 




1 




+1 > n fi 


- . p = 0.4 










0.4 


' p = 0.6 










0.2 


- • p = 0.8 




I.... 






0.0 








I** 









10 



20 



50 



FIG. 4. Fraction of vertices spanned by the LCC after weak 
(si, solid, left to right) and strong (S", dashed, right to left) 
link removal. Symbols: numerical simulations with A'^ — 1000; 
curves: theoretical results. 
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FIG. 5. Average clustering coefficient after weak (C^, 
solid, left to right) and strong {C^ , dashed, right to left) 
link removal. In both cases, four choices of p are considered: 
from top to bottom, p — 0.8, p — 0.6, p — 0.4, and p = 0.2. 



lution to the above equation, which is easily obtained 
numerically. In fig. 4 we show that such solution is per- 
fectly confirmed by numerical simulations. For weak link 
removal, as in the ER model a percolation phase tran- 
sition is observed at the critical probability p+ ~ 1/-^^, 
corresponding to the critical weight 



Inp 



(12) 



When edges are removed in increasing weight order, the 
LCC will span a finite fraction of the N vertices as long as 
edges with weight w < w'^ are removed. As soon as edges 
with weight or larger are removed, the network will 
fragment into many small connected components. For a 
finite network with < p < 1, w'^ is always finite and 
nonzero, as shown in fig. 4. For infinite networks, w'^ 
depends on how p scales with N. In particular 



lim w'^ — — 

N^oo a 



if N- 



(13) 



where < a < 1 (note that a > 1 would imply the 
absence of the giant component already in the original 
network) . For strong link removal, the critical probability 
p~ ~ 1 /N defines the critical weight 



Hp-1IN)_^^^ 
h\p 



(14) 



which is always zero, both for large but finite networks 
with < p < 1 and for infinite networks, independently 
of how p scales with N . Therefore we find the surprising 
result that the strongest links are completely inessential 
to the robustness of the network: the giant component 
is preserved until the weakest edges are removed, and no 
phase transition is observed for finite w. As we men- 
tioned, real networks display exactly such striking differ- 
ence between weak and strong link removal. However, 
since this behaviour is displayed even by our completely 
random model, where communities are clearly absent, we 



find that the differences between strong and weak link re- 
moval are not a signature of community structure in real 
networks, in constrast with usual interpretations [24-26]. 

Opposite considerations apply to the clustering coef- 
ficient. After weak or strong link removal, the aver- 
age clustering coefficient of the unweighted projection 
simply equals the link density p^. In fig. 5 we plot 
= as a function of w for weak (from left to right) 
and strong (from right to left) link removal. We find a 
convex, rapidly decaying curve in the former case, and a 
concave, slowly decreasing curve in the latter case. Re- 
markably, this is opposite to what is observed for real 
weighted networks [25,26], where weak link removal re- 
sults in a concave, slowly decaying curve and strong link 
removal in a convex, rapidly decaying curve. The strik- 
ing difference arises because triangles and weights are lo- 
cated uniformly in the WRG, while large- weight triangles 
are located mainly within communities in real networks. 
Therefore, unlike the LCC, the clustering coefficient sig- 
nals community structure successfully. 

The above results can be easily generalized to the ap- 
pearance of weighted subgraphs of any order and inten- 
sity. Detecting weighted subgraphs in larger networks 
is an important and stimulating problem [27]. The in- 
tensity i{g) of a weighted subgraph g has been defined 
as the geometric mean of its edge weights [9]. In an 
ER random graph with N vertices, a subgraph of k ver- 
tices and / links appears almost surely when p scales at 
least as A^"*"'/' [1]. This allows to solve the correspond- 
ing weighted problems: when does a subgraph with k 
vertices and / weighted edges, each of weight at least (or 
at most) w, appear almost surely in a WRG of A^ ver- 
tices? The edges forming such subgraphs are drawn with 
probability p^ , respectively. Therefore subgraphs with k 
vertices and I weighted edges, each of weight at least w 
or at most w appear almost surely if 



P > N-''/^ 



(15) 
(16) 
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respectively. Note that these subgraphs of order k have 
an intensity i > w in the former case and i < w (with 
w > 0) in the latter case. In particular, one may be 
interested in the appearance of loops, which in the un- 
weighted case play an important topological role [28]. 
Since loops of order k are special subgraphs with I = k 
edges, weighted loops of intensity i > w appear when 
p > iV~^/"', and of intensity i <w appear whenp > A^~^ 
independently of w {ifw > 0). In both cases, loops of any 
order and with the same intensity appear simultaneously, 
as in the unweighted case. The former condition confirms 
that, in order to have a giant component (which is char- 
acterized by loops of all orders) made of edges of weight 
at least w, one must have p > N~'^/^ . Similarly, the ap- 
pearance of weighted loops confirms the behaviour under 
edge removal of the clustering coefficient C^, which is 
contributed by triangles (loops of order 3). 

We have introduced the WRG as the weighted con- 
terpart of the ER random graph, and derived many of 
its properties exactly. The WRG displays the weighted 
properties that are merely due to the intrinsic variability 
in edge weights, and not to true correlations. There- 
fore it is a fundamental reference for the analysis and in- 
terpretation of the properties of real weighted networks. 
Sophistications of the model, in particular allowing dif- 
ferent edges to be governed by different parameters as 
in eq.(l), allow to extend the model to power-law dis- 
tributed strengths or degrees [19]. 
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